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On the concentration of eigenvalues of random symmetric matrices 

Michael Krivelevich* Van H. Vu ^ 

O' 

o: 

^NJ . Abstract 

JJT^' We prove that the few largest (and most important) eigenvalues of random symmetric matrices 

^^ . of various kinds are very strongly concentrated. This strong concentration enables us to compute 

. I the means of these eigenvalues with high precision. Our approach uses Talagrand's inequality and 

^Sj ' is very different from standard approaches. 

^ ! 1 Introduction 

en . 

^^ . In tliis paper we consider the eigenvalues of random symmetric matrices whose diagonal and upper 

^s^ , diagonal entries are independent random variables. Our goal is to study few largest /smallest eigen- 

^^ I values of such a matrix. Let us begin with a version of Wigner's famous semi-circle law [T^, due to 

^^ I Arnold [Q, g], which describes the limiting behavior of the bulk of the spectrum of a random matrix 

'^ I of this type. 

r^ ' Semi-circle law. For 1 < i < j < n let aij be real value random variables such that all aij, i < j 

C^ ' have the same distribution and all an have the same distribution. Assume that all central moments of 

ti ■ the aij are finite and put a^ = cr'^(aij). For i < j set aji = a^j and let A^ denote the random matrix 

*> • {o,ij)i- Finally, denote by Wn{x) the number of eigenvalues of A^ not larger than x, divided by n. 

Then 
H : lim Wn{x2a^/^) = Wix) , 

in distribution, where W{x) = if x < —l,W{x) = 1 if x > 1 and W{x) = ^ J^^{1 — x^^'^dx if 
-1 < x < 1. 

The semi-circle law gives only a limit distribution and does not tell anything about the behavior 
of the largest /smallest (and usually most important) eigenvalues. These eigenvalues were studied in 
several papers [^, ^ ^, |9|. The method used in these papers is to estimate the expectation of the trace 
of a high power of the matrix. This frequently leads to a sharp upper bound on the largest eigenvalue 
(see Section 2). 

Given a symmetric matrix A, we denote by Si{A) > 52{A) > . . . > SniA) the eigenvalues of A. 
Furthermore, let Ai(^) = max^^i(|(5,A)|) = max(|(5i(A)|, |(5„(^)|) and X2{A) = max(|(52(A)|, |(5n(^)|)- 
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The purpose of this paper is to prove large deviation bounds for Ai, A2, (^i, 62 and (5„. We beheve 
that these results are of interest for a number of reasons. The first is that our results are obtained 
under a very general assumption on the distribution of the entries of a random symmetric matrix. 
Secondly, our large deviation bounds turn out to be very strong. Moreover, they are sharp, up to a 
constant in the exponent, in a certain deviation range. Also, our method appears to be new; it makes 
a novel application of the recent and powerful inequality of Talagrand |1^]. Finally, since bounds 
on the largest eigenvalues of a symmetric random matrix are widely used in many applications in 
Combinatorics and Theoretical Computer Science, we believe that our results have a potential in 
these areas. As an example of such an application, we would like to mention a paper ||5[ of the 
present authors, where a version of our theorems has been used to design approximation algorithms 
with expected polynomial running time for such important computational problems as finding the 
chromatic number and the independence number of a graph. 

Our first result involves the following general model. Let aij (1 < i < j < n) be independent 
random variables, with absolute value at most 1. A symmetric random matrix A is obtained by 
defining Ojj = aij for all i < j. 

Theorem 1 There are positive constants c and K such that for any t > K , 

Pr[\\M)-E{\i{A))\>t\<e-''" . 

The same result holds for both 6i{A) and 6n{A). 

The bound in Theorem |l| is sharp, up to the constant c, when t is sufficiently large. The surprising 
fact about this theorem is that it requires basically no knowledge about the distributions of the aij. 

Our second theorem provides a large deviation result for the second largest eigenvalue of a sym- 
metric random matrix A, under the additional assumption that all non-diagonal entries of A have the 
same expectation p > 0. 

Theorem 2 For every constant p > there exists constants Cp,Kp > so that the following holds. If 
in addition to the conditions of Theorem^ the random variables aij, I < i < j < n satisfy E[aij] = p, 
then for all t > Kp, 

Pr[X2{A) - E{\2{A))\ >t]<e-^''*' . 

The same result holds for 52{A). 

One particular application of the above theorem arises when all diagonal entries of A are 0, and 
each non-diagonal entry of yl is a Bernoulli random variable with parameter p, i.e. Pr[aij = 1] = p, 
Pr[aij = 0] = 1 — p. In this case A can be viewed as the adjacency matrix of the random graph 
G{n,p). Thus Theorem]^ provides in this case a large deviation result for the second eigenvalue of a 
random graph. In fact, for this special case. Theorem |2| can be extended for p decreasing in n (see 
Section 5). 

The rest of the paper is organized as follows. In the next section, we collect some information 
about the expectations of the eigenvalues in concern. More interesting, it turns out that our theorems 



can sometimes be used to estimate these expectations. The proofs of Theorems |l| and |2| appear in 
Sections 3 and 4, respectively. We end with Section 5, which contains few remarks and open questions. 
In what fohows, a matrix is always symmetric and of order n, if not otherwise specified. We 
assume that n tends to infinity and the asymptotic notations (such as o, O, etc) are understood 
under this assumption. The letter c denotes a positive constant, whose value may vary in different 
occurrences. Bold lower case letters such as x, y denote vectors in EP and xy is the inner product of 
X and y. Given a matrix A, xAy is the inner product of x and Ay. 1 is the all one vector. 

2 Expectations 

In this section, we present several results about the expectation of the relevant eigenvalues. We also 
show that our theorems can be used to determine these expectations in some cases. 

Let Ojj, ^ < j be independent random variables bounded in their absolute values by 1. Assume 
that for i < j, the aij have common expectation p and variance a'^. Furthermore, assume that 
-E'iflM] = i^ for all i. Fiiredi and Komlos (Q, Theorem 1), showed that if p > then 

^[Ai (A)] = {n-l)p + i^ + a^p + o(l) . (1) 

Also, in this case under a weaker assumption yAi?[ajj] < o"^ for all 1 < i < j < n the argument of 
Fiiredi and Komlos gives: 

^[A2(^)] < 2a^/^ + 0{n^/^ log n) . (2) 

The situation changes when p = 0. In the same paper, Fiiredi and Komlos (implicitly) showed 
that in this case (again assuming only T/yli?[ajj] < a'^) 

E[Xi{A)] < 2a^/^ + 0{n^/^logn) . (3) 

Fiiredi and Komlos also claimed that if Var[aij] = a"^ then with probability tending to 1, Ai(yl) > 
2ay/n + 0{v}/^ log n). 

Using our Theorem |l[ we first show that a statement slightly weaker than ([l|) holds under a more 
general assumption that the variances are upper bounded by a^, but are not necessarily equal. Next, 
we prove a lower bound stronger than that stated by Fiiredi and Komlos. 

Corollary 2.1 If all entries aij of the random symmetric matrix A = {oij) are hounded in absolute 
value by 1, and all non-diagonal entries have common expectation p > 0, then 

E[\i{A)] =np + 0{V^) . 

Proof. For each entry Oij, one can define a random variable Qj, satisfying \cij\ < 1, -E[cjj] = 0, 
FAi?[cij] = 1 — VAR[aij]. Let now bij = aij — Cij. Then clearly E[bij] = p, VAR[bij] = 1. Denote 
B = (bij), C = dj, then A = B + C. Hence Ai(^) < Ai(S) + Ai(C). Applying 1^, (|) and Theorem 
H, we obtain: 

Pr[Xi{B)<np + 0{l)] > ^, 

Pr[X^{C) < 0{V^)] > |, 



and thus Pr[Xi{A) = np + 0{^fn)\ > 1/2. Invoking Theorem |l| once again, we get the desired result. 



D 



By the same argument, one can show that if o" = uiji ^''^), then E{5n) = 2(Tn -^'^(1 + o(l)). 



Corollary 2.2 If all entries aij of the random symmetric matrix A = (aij) have common expectation 
and variance a'^ , then 

E[Xi{A)] > 2fjn^/2 ^ o(iogi/2^) . 
Consequently, with probability tending to 1, 



\i{A) > 2(7ni/2 + 0(logi/ 



^n) 



Proof. For the sake of simplicity, we assume a = 1/2. Furthermore, set // = n^'"^, k = [/^log ' n] 
and X = a log ' n, where a is a positive constant chosen so that the following two inequalities hold: 



/^ 



V^^^^ > 2(// - x/2)'' (4) 



^2tloKi/2n-ct2 



^ e^*^"^^'^"-'^* =0(1), (5) 

i=|logl/2„ 

where c is the constant in Theorem 1. Without loss of generality, we assume that k is an even integer 
and let X be the trace of A''. It is trivial that E[X] < nii^[Aj]. On the other hand, a simple counting 
argument (see ^) shows that 

It follows that 

^1^'' ^ Whri (k/2) "'O - 1) ■ ■ ■ (n - (k/2)) > //«=»'^ . (6) 

Assume, for contradiction, that -E(Ai) < /z — x. It follows from this assumption that 

00 
E[X^i] <{fi- xl2f + ^ (^ - X + (t + l))*^Pr[Ai > ^ - x + t] . (7) 

t=a:/2 

By Theorem |l], Pr(Ai > ^ - x + 1) < e"^*" for all t > x/2. Thus (|),(|) and (0) imply 

00 
^ (m - X + (t + l))'=e-'^*' > //A;5/2 - (^ - x/2)'= > (^ - x/2)^ (8) 

t=x/2 

Since (/i - x + (t + 1))V(m - x/2)^ < e(i+°(i))WM = e(i+o(i))ti°g'^'«, (|) and (|) imply a contra- 
diction, and this completes the proof. □ 



To end this section, let us mention few recent results of Sinai and Soshnikov. In |^, Sinai and 
Soshnikov showed that if aij have symmetric distributions and their moments satisfy some mild 
assumptions, then Pr[Xi{A) < 2a ^/n^ 0(1)] = 1 — o(l). They also stated that a similar result would 
hold without the symmetric assumption. Furthermore, Soshnikov proved in [So] that under the same 
assumptions about Ojj, the joint distribution of the k dimensional random vector formed by the first 
k eigenvalues, scaled properly, tends to a weak limit, for any fixed k. 

3 Proof of Theorem |I] 




The key tool of the proof is a powerful concentration result, due to Talagrand |10|. To state this 
inequality, we first need to define the so-called Talagrand distance in a product space. Let Qi, . . . , fi^ 
be probability spaces, and let fi denote their product space. Fix a set ^ C Jl and a point x = 
(xi, . . . ,2;^) S ^- We say that x has Talagrand distance t from ^ if i is the smallest number such 
that the following holds. For any real vector a = (ai, . . . , am), there is a point y = (yi, . . . Hm) £ -4 
such that 

Let At denote the set of all points with Talagrand distance at most t from A. Talagrand proved 
that for any t > 0, 

Pr[^]Pr[A] < e-*'/^ 

where At denotes the complement of At- Remarkably, the rather abstract and difficult definition of 
the Talagrand distance suits our problem perfectly, as shown in the proof below. 

Consider the product space spanned by aij,l < i < j < n. A vector in this space corresponds to 
a random matrix. Let m, be a median of Ai and let A be the set of all matrices (vectors) T such that 
Ai(^) < rn. By definition, Pr[A] > 1/2. By a well known fact in linear algebra 

n 

Ai(^) = max V" (viWj + VjWi)tij + y2viWitii . 

Il^il = ll^ll=ll<i<i<n i=l 

Consider a matrix X = (xij) where Ai(X) > m + t. There are vectors v = (vi, . . . ,Vn), w" = 
{wi, . . . , Wn) with norm 1 such that 

n 

vXw = ^ {viWj + VjWi)xij + ^ ViWiXii >m + t. 

l<i<j<n i=l 

On the other hand, for any Y = {yij) G A 

n 

vYw = ^ {viWj + VjWi)yij + ^ ViWiVii < m . 

l<i<j<n i=l 



Set Qy = (viWj + VjWi) for 1 < i < j < n and an = viWi for 1 < i < n. It is easy to show that 

Y: al<2{Y: vf){±w])=2. 

l<i<j<n l<i<n i=l 

Since \xij — yij\ < 2, we have 

1/2 



t 



a 



2 






By definition, it follows that X G -^t/Vs- Therefore, by Talagrand's inequality 

Pr[Xi{A) >m + t]< 2e-*'/22 _ (g) 

Let B be the set of A such that Ai(^) < m — t. By a similar argument, one can show that if Xi{A) > m 
then A G -Sw /g. Recall that Pr[\i{A) > m] > 1/2. Thus Talagrand's inequality implies 

Pr[\i{A) <m-t\< 2e-*'/^2 . (10) 

^Prom here, one can derive that the difference between the median and the expectation of Ai is 
bounded by a constant: 

\E{\i{A))-m\ < ^(|Ai-m|)</ tPr[\\i{A) - m\ > t]dt 

Jo 

< / Ate-' /^^dt = 64 . (11) 

Jo 

Inequalities @ , (|lO| ) and ([ll|) together imply the desired deviation bound for Ai (A) . The statements 
involving di{A) and 6n{A) can be proved in a similar way, using the following equalities: 

6i(A) = max x^x. 

x,llx|l=l 

5n{A) = min x^x . 

x,j|x|j=l 

The sharpness of the result. The following example shows that the bound in Theorem || is best 
possible, up to a multiplicative constant in the exponent. 

Assume that aij, 1 < i < j < n, have the following distribution: Uij = 1 with probability p and 
aij = —p/q with probability q = 1 — p. A matrix is fat if it contains an all 1 principle sub-matrix of 
size -E[Ai] + 1. It is trivial that if A is fat then Ai(j4) > £'[Ai] + 1. On the other hand, the probability 

that a matrix is fat is at least p(^['^i]+*) = e"'- ^ ^^ ' °^p . Thus, if p is a positive constant and t is 
of order r2(ii^[Ai]), then 

Pr[\Xi-E[Xi]\>t]>e-'^'\ 

for some positive constant c. 



4 Proof of Theorem @ 

Given a symmetric matrix A, X2{A) can be expressed as follows 



AofA) = min max xAy 

l|x|| = l|y||=i 

XV— yv— 



Define 

/i2(A) = max xAy. 

xl=yl=0 

l|x|N||y|Nl 

It is clear that /X2(yl) > A2(j4) for any matrix A. In the rest of the proof, we use shorthands fi2, A2 for 
^2(^)5 '^2(^)1 respectively, where A is distributed as described in the theorem formulation. Similar 
to the previous section, by Talagrand's inequality we can show 

Lemma 4.1 There are positive constants c and K such that for any t > K 

Pr[\^Ji2-E{^i2)\>t\<e-'''" . 

Set A' = A — pJn, where Jn denotes the all one matrix of order n. It is easy to show that 
M^) < Ai(A'). Indeed, 

uo = max x(A' + pJn)Y = max xA'y 

x,y ^ ^ x,y 

xl=yl=0 xl=yl=0 

||63;|| = ||y|| = l l|6a:|| = ||y||=l 

< max x^'y = Ai(^') , 

- ||bx|| = !|y||=l -^ '^ ^ ' 

where the second equality uses the fact that x and y are orthogonal to the vector of all I's and are 
thus orthogonal to every row of J„. 

Since each non-diagonal entry of A' has mean and is bounded in absolute value by 1 -|-p < 2, by 
the result @ of Fiiredi and Komlos, £'[Ai(^')] < 3^/n. Assume that t > Wy/n; Theorem || implies 
then 

Pr[|A2 - ^(A2)| > t] < Pr[Xi{A') > E{Xi{A')) + 1/2] < e"^*' . 

The proof of the case t < 10-^/n is harder and is based on the following two lemmas. 

Lemma 4.2 For every constant p > there exist constants Cp > 0, Kp > so that for any Kp < t < 

10-^/n, there is a positive number et = 0{t^''^ {np)~^l'^) such that 

Pr[^X2-{l+et)\2>t\ <e-^^*' . 



Lemma 4.3 For every constant p > there exists a constant Lp > such that 

E[fi2] - E[X2] < Lp . 



Assuming these two lemmas hold, we can finish the proof as follows. First assume, without loss 
of generality, that t > 5Lp. Consider the upper tail: 

Pr[X2>E{X2) + t] < Pr[f,2>E{X2)+t] 

< Pr[li2>E{fi2) + {t-Lp)\ 

,-n((t-Lp)2) ._ 



< e 



-c„t^ 



by Lemma 4.1 



Now consider the lower tail: 

Pr[X2 < E[X2] - t] < Pr[{l + et)X2<{l + et)E[X2]-t] 

< Pr[fi2 < (1 + et)E[X2] - t/2] + Pr[fi2 - (1 + et)A2 > t/2] . 



By Lemma 4.2 



Pr[iJ2- {I + et)X2> t/2] <€-''''' . 



On the other hand. 



Pr[fi2 < (1 + et)E[X2] - t/2] < Pr[/i2 < (1 + et)^[^2] - t/2] . 

Given that t is sufficiently large, etE[fj,t] = 0{t^''^) < t/A. So, by Lemma 4.1, the last probability can 
also be bounded by e~'^p^ and this completes the proof. □ 



To prove Lemmas 4^ and O we need three other lemmas. The first two (Lemmas 4^ and [l.q ) 
are linear algebraic statements. The last one (Lemma ^^ is a statement about the concentration of 
a certain random variable, which is a function of the entries aij of the random symmetric matrix A. 

Lemma 4.4 Let A be an n by n real symmetric matrix. Let a satisfy < a < yjn. Denote by vi 
a unit eigenvector corresponding to Xi{A). Assume there is a number ci,0 < |ci| < ^/n such that 
||1 — civill < a. Then 

yn — a {^/n — a)^ 
Proof. Note first that 

||civi|| = ||(civi — 1) + 1|| > II 1 II — ||civi — 1|| > "v/n — a . 

Assume that f^2{A) = xAy, where x, y are unit vectors perpendicular to 1. Then 

x(civi) = X, (civi — 1 + 1) = x(civi — 1) < ||x|| • ||civi — 1|| < a . 

Notice that as ||1|| = y/ny and a < y/n, we have ci 7^ 0. Define 

x(civi) 



X = X 



(civi)(civi) 



ClVl 



Then x' is orthogonal to 1 and satisfies ||x'|| < ||x|| = 1. Set u = x — x'. Then 

_ |x(civi)| ^ a 



||civi|| ^yn — a 

Similarly, set 

/ y(civi) 

(CiVi)(CiVi) 

then y' is a vector of norm at most 1 orthogonal to vi. Denoting w = y — y', we can prove that 
||w|| < a/{^yn — a). 

By definition, A2(^) > |x'Ay'|. On the other hand, by the Cauchy-Schwartz inequality 

x^y — X Ay = (x + u)^(y + w) — x Ay = wAx + uAy + u^lw 

< ||vir||Px'|| + ||u||||^y'|| + Ai(yl)||ii||||tt;|| . 

Recall that x',y' are orthogonal to the first eigenvector of A. Therefore, ||x'yl|| and ||Ay'|| are at 
most X2{A). Then 

f ^ ^ ^.^^«^2(v4) , aX2iA) a^Xi{A) 2aA2(A) a^XiiA) 



^Jn — a ^/n — a {y/n — aY ^/n — a (y^ — a)^ ' 
and the lemma follows. □ 

Lemma 4.5 Let A = (aij) be an n by n real symmetric matrix. Assume that s and X are positive 
numbers satisfying A2(^) < s/2 and I]r=i(Z]j=i o-ij ~ ^)'^ ^ ^- Then there is a number ci, |ci| < y/n 
such that II 1 — civill < 2^/X/s, where vi is a unit eigenvector corresponding to Xi{A). 

Proof. . Let vi, . . . , v„ be unit eigenvectors of ^, corresponding to the eigenvalues Ai(A), . . . , An(^), 
respectively. Since these vectors form an orthogonal basis of i?", we can express the vector 1 as their 
linear combination: 

n 

where |ci| < ||1|| = y/n. It is not too difficult to check the following relations: 

J2{\{A) - sfcj = \\{A- sl)lf = J2{JZ %• - ^f ■ 

i=l i=l jr'=l 

By the assumptions of the lemma we get then 

n n n 2 it- 

i=l j=l 1=2 i=2 



Therefore, 



as desired. □ 



II ||2 V^ 2 ^ 4X 

|l-civi|| =}_^c,<-^ 

i=2 * 



Lemma 4.6 Let aij, l<j<i<nbe independent random, variables, uniformly bounded by 1 in 
their absolute values. Assume that for i > j, the atj have a common expectation p. Define Oij = aji 
for j > i. Then there exists an absolute constant c > so that for all t > 1, 



Pr 



Y.lY.'^ij-^p] ^ *^^ 



Proof. For 1 < i < n, let pi = E[aii]. We define 



< e 



-cf 



Yi = (^ Oij - npf , 
i=i 

then Y = Er=i(Ei=i ai, - npf = Eti Y^. 

We first estimate from above the expectation of Yi. Set bij = aij for ah j ^ i, set also ha = 
an + P — Pi- Then E[bij] = p for all 1 < i,j < n. We obtain: 

(n \ n n 

J2i^ij - ^P) + (Pi - P)M = (YI ^ij ~ ^P)'^ + 2(^» -P)Y1 ^ij ~ '^P) + (Pi ~ Pf ■ 

Recall that bij are independent random variables with a common mean p. Therefore 

n 

E[Yi\ = EiY^bi.-npfj + ipi-p)^ 

n n 

= VAR[Y, bij] + (pi -P? = Y. y^RK] + (Pi - P? < n{l -p) + {pi - pf 

3=1 3=1 

< n , 

for large enough n. This implies that E[Y] = J2t=i ^[Yi] < n'^- 

Now, it is easy to see that for every 1 < j < i < n, changing the value of the random variable aij 
can change the value of Y by at most Cij = 0{n) (recall the assumption \aij\ < 1). Then the so called 
"independent bounded difference inequality", proved by applying the Azuma-Hoeffding martingale 
inequality (see,, e.g., |Q]), asserts that for every h > 0, 

Pr[Y - E[Y] >h]< exp{-/iV2 ^ cfj} < exp{-h^ /0{n'^)} . 

l<j<i<n 

Substituting h = (t — l)n^ and using the fact E[Y] < n^, we get the desired bound on the upper tail 

of y. □ 



Proof of Lemma 4.2. Recall that by (0 we have -E[Ai[A] = 0{^/n). From the analysis of the case 
t > W^/Ti it follows then that Pr[X2 > np/2] < e'^^'^'P^'' < e"^*". Also, by Corollary |2]| S[Ai (A)] = 



np 



+ o{n). Combined with our Theorem |l|, this implies that Pr[Xi{A) > 2np] < e~^^'^P' < e"*^* . 
These two facts, together with Lemma ^^, show that for that if t < W^/n, then with probability at 
least 1 — e~^^ , the following three properties hold: 



10 



1- EUiE]=ia,,-npr<nH; 

2. X2{A) <np/2; 

3. Ai(^) <2np. 

Assume that a matrix A satisfies conditions 1, 2 and 3 above. Applying (in this order) Lemma [4.5| 
with X = n?t and s = np and Lemma iA with a = 2X^''^ /s = 2t^'^/p, we have that with probabihty 
at least 1 — e" 



-ct^ 



/ .^ / 2a \ , ^, a^Xi(A) AX 



n — a 



2np 



n — a) 



< 



9t 



s' {^-2^/sY - P 



Substituting the value of a, we get: 

Pr[^,2- (i 



^1 A. > ^] < e-^ 
y/np ) p 



The proof is completed by rescaling, namely, by setting t := t/p. □ 



Proof of Lemma 4.3. First notice that 



E[fi2] - E[X2] = E[fi2 - As] < / tPr[ii2 - X2 > t]dt . 

Jo 



Moreover, 



tPr[fi2 - A2 > t]dt < tdt + 



Kr, 



tPr[fj,2 - X2>t]dt+ / tPr[/i2 > t]dt 

JlOy^ 



The first integral is clearly bounded by a constant depending on pi only. By Lemma 4.1 and the fact 
that E{fi2) < 3y/n, Pr[n2 > t] < e"^*" for t > W^/n. Thus J^^tPr[n2 > t]dt < J^ te-'^^^dt = 0(1). 
To bound the second integral, note that 



"lOV" 



nlOV" 



/ tPr[^2 - >^2 > t]dt < / tPr[fj,2- X2>t/2 + €tX2]dt 

JKp JKp 

+ / tPr[etX2 > t/2]dt. 

Jk„ 



By Lemma 4.2 



"lOv^ 



lO^n 



/ tPr[n2- h>t/2 + etX2]dt< te'""^ dt = h , 

JKp Jo 

where Zi > is a constant depending only on p. 

On the other hand, we know that et < bt^''^{np)~^''^ for some constant b. Using the analysis of 
the case t > W^/n, assume that Kp > (306/p)^; for any K <t < W^/n we have: 

Pr[etX2 > t/2] < Pr[X2 > ^-i^iripf'^] < e-'^'" . 



11 



This implies that 

/ tPr[etX2 > t/2]dt < / te'"^* dt = I 

JKr, Jo 



where I2 is a constant depending on p only. This completes the proof. □ 

The proof for 62 is similar. Instead of /i2, consider /i2 = max^ ijx=i||,xi=o^^^- Again, using 



Talagrand's inequality one can obtain a version of Lemma 4.1 for //2- The rest of the proof is similar 
and we omit the details. 

5 Concluding remarks 

• Unfortunately, we are unable to extend Theorem ^ to the case when the expectation p of the 
non-diagonal entries of a random matrix An is a function of n and tends to zero as n tends 
to infinity, without imposing additional restrictions of the distribution of entries. However, 
Theorem 2 can be extended in the following special but important case: the diagonal entries of 
An = {o-ij) are all zeroes, and the entries above the main diagonal are i.i.d. Bernoulli random 
variables with parameter p = p{n), i.e., Pr[aij = 1] = p and Pr[aij = 0] = 1 — p for all 
^ ^ i < j ^ n. In this case the random matrix An can be identified with the adjacency matrix 
of a random graph G{n,p), and the eigenvalues of An are the eigenvalues of a random graph on 
n vertices. Under these assumptions we have the following result. 

Theorem 3 There are positive constants c and K such that if p = ijj{n~^) then for any t > K, 

Pr[\X2{A)-E[X2{A)]\>t]<e-''\ 
where A is the adjacency matrix of G{n,p). The same result holds for 62{A). 

This theorem can be proved by repeating the arguments in the proof of Theorem 2 under 
the new assumptions. We have to make some significant changes only in the proof of Lemma 
4.6. The method of bounded difference martingale (Azuma-Hoeffding's inequality) seems not 
powerful enough to prove the the statement Lemma 4.6 when p is decreasing in n, and we need 
to invoke a recent concentration technique presented in [O]. The details are omitted. Notice 
that in many graph theoretic applications the eigenvalue X2{A(G)) is of special importance as 
it reflects such graph properties as expansion, convergence of a random walk to the stationary 
distribution etc. 

• Though we could show the tightness of our main result (Theorem ||) in some cases and for 
some values of the deviation parameter t, it will be extremely interesting to reach a deeper 
understanding of the tightness of Theorem |l] for the whole range of t and for some particular 
important distributions of the entries of A. 
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• Theorem |^ is obtained under very general assumptions on the distribution of the entries of a 
symmetric matrix A. Still, it will be very desirable to generalize our result even further, in 
particular, dropping or weakening the restrictive assumption about the uniform boundness of 
the entries of A. This task however may require completely different tools as the Talagrand 
inequality appears to be suited for the case of bounded random variables. 

• Finally, it would be quite interesting to find further applications of our concentration results in 
algorithmic problems on graphs. The ability to compute the eigenvalues of a graph in polynomial 
time combined with an understanding of potentially rich structural information encoded by the 
eigenvalues can certainly provide a basis for new algorithmic results exploiting eigenvalues of 
graphs and their concentration. 
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